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. Abstract 

We prove a structure theorem for the feasible solutions of the Arora-Rao-Vazirani SDP 
-^ relaxation on low threshold rank graphs and on small-set expanders. We show that if G is 

a graph of bounded threshold rank or a small-set expander, then an optimal solution of the 
Arora-Rao-Vazirani relaxation (or of any stronger version of it) can be almost entirely covered 
by a small number of balls of bounded radius. 

Then, we show that, if k is the number of balls, a solution of this form can be rounded with 
an approximation factor of 0(\/\og k) in the case of the Arora-Rao-Vazirani relaxation, and with 
a constant-factor approximation in the case of the fc-th round of the Sherali- Adams hierarchy 
c/2 ' starting at the Arora-Rao-Vazirani relaxation. 

The structure theorem and the rounding scheme combine to prove the following result, where 
G — (V,E) is a graph of expansion 0(G), A& is the fc-th smallest eigenvalue of the normalized 
Laplacian of G, and (f>k(G) = mindisjoint s ly ...,S k maxi<i<fc <fi(Si) is the largest expansion of any k 
disjoint subsets of V: if either Aj, > log 2 ' 5 k-<p(G) or 4>k{G) > log k ■ Vlog n ■ log log n ■ (j>{G), then 
the Arora-Rao-Vazirani relaxation can be rounded in polynomial time with an approximation 
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(^ ' ratio 0(y/logk). 

Stronger approximation guarantees are achievable in time exponential in k via relaxations in 

_il ' the Lasserre hierarchy. Guruswami and Sinop [GS13] and Arora, Ge and Sinop [AGS13] prove 

that 1 + e approximation is achievable in time 2°( fc ' poly(n) if either Afc > 0(G)/ poly (e), or if 
SSE n /j. > -y/kog k log n ■ 0(G)/ poly (e), where SSE S is the minimal expansion of sets of size at 
most s. 

X ■ 1 Introduction 
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We study approximation algorithms for the uniform sparsest cut problem in regular graphs 1 based 
on semidefinite programming. Let G = (V, E) be a r-regular graph. The expansion of a set S C V 

is the ratio 

\E(S,S)\ 
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The analysis can be extended to the case of general graph, in which case the problem becomes the conductance 
problem, and the condition on small-set expansion becomes a condition on small-set conductance. 



We want to find the smallest expansion of nonempty subsets of size at most n/2. We use 4>{G) to 
denote the value of the optimum, 

0(G) := min MS). 

S:0<\S\<n/2 

We will not define the uniform sparsest cut problem, but any approximation of expansion is, up to 
an additional factor of 2, an approximation of the uniform sparsest cut problem. 

Chawla et al. [CKK + 06] show that assuming unique games conjecture there is no constant factor 
approximation algorithm for a more general version of the problem known as non-uniform sparsest 
cut problem. The best polynomial time (or even sub-exponential time) approximation algorithm 
for uniform sparsest cut problem remains the algorithm of Arora, Rao and Vazirani [ARV09] , which 
achieves an 0(-\/logn) approximation factor, where n is the number of vertices. The Arora-Rao- 
Vazirani algorithm is based on a semidefinitite programming (SDP) relaxation, which we will refer 
to as the "ARV" relaxation. 

Within the long-term research program of developing better approximation algorithms for uni- 
form sparsest cut in general graphs, there has been much success in the past few years toward 
developing better algorithms for restricted classes of graphs. 

The technique of subspace enumeration [KT07, Kolll, ABS10] applies to the special class 
of graphs known as "low threshold rank" graphs. Let G = (V, E) be a r-regular graph, and 
C := I — A/r be the normalized laplacian matrix of G, where I is the identity matrix and A is the 
adjacency matrix. Let 

= Ai < A 2 < . . . A n < 2, 

be the eigenvalues of C A graph G has a low threshold rank, if A^ = S7(l) for a small number k. 
Low threshold rank graphs can be considered as a generalization of expander graphs. 

Arora, Barak and Steurer [ABS10] show that the technique of subspace enumeration devel- 
oped in the work of Kolla and Tulsiani [KT07, Kolll], achieves a 0(1/Afc) approximation in time 
2°( k ) poly(n). Later, Barak, Raghavendra and Steurer [BRS11] and Guruswami and Sinop [GS11] 
match this 0(1/Afc) approximation factor in time n°( k > by using an SDP relaxation that is derived 
from the Arora-Rao- Vazirani relaxation by k "rounds" of a procedure defined by Lasserre. The 
procedure starts from a SDP relaxation of a combinatorial problem which can be formulated as a 
0/1 integral program, and defines a family of relaxations with additional variables and constraints. 
The fc-th relaxation in this family has size n°( k >. We will refer to the fc-th relaxation in this family 
as L fc -ARV. 

These techniques are very powerful, and they lead to approximation algorithms for many 
constraint satisfaction problems including maxcut, sparsest cut, min uncut, graph coloring etc 
[AG11, GS12, GS13, OT12]. These algorithms run in time that is exponential in k, and they typi- 
callyprovide an approximation ratio of l/poly(Afc). A notable exception is the work of Guruswami 
and Sinop [GS13], who show that even if A& <C 1, but there is a gap between A& and 0(G), such 
as, say, Afe > 20(G), then a constant-factor approximation can be derived from Lfc-ARV. The 
approximation ratio can be made arbitrarily close to one if the ratio Afc/0(G)is sufficiently large. 
Because of the exponential dependency on k in the running time of all of the above algorithms, 
we may obtain a polynomial time algorithm only for graphs with a fast growing spectrum, i.e., if 
k = O(logn). 

Let SSE S (G) denote the minimal expansion of a subset of < s vertices of G, 

SSEJG) := min MS). 



Arora, Ge and Sinop [AGS 13] have recently announced a new rounding scheme for L^-ARV that 
achieves a (1 + e) approximation provided that SSE n / fc (G) > ^/log k • log n ■ L(k) /poly(e), where 
L(k) is the optimum solution of the k rounds of Lasserre hierarchy applied to the ARV relaxation. 
Roughly speaking, provided that there is a large enough gap (of order -^/log n log k) between the 
expansion in sets of size < n/2 and in sets of size < n/k, they can design a PTAS for the uniform 
sparsest cut problem. Qualitatively, this restriction is in the same spirit as a restriction on graphs 
in which A& is large. Indeed, [LOT12, LRTV12] prove that SSE n / fc < 0(\/\2k^og k), so that a 
requirement on small-set expansion is a stronger assumption than a requirement on A^. Quan- 
titatively, however, the quadratic gap in the results of [LOT12, LRTV12] makes the restriction 
on small-set expansion much weaker in some cases. For example, a cycle has expansion ~ 1/n, 
and SSE n /fc ss k/n, while A& « ^. This means that the condition A& > 20 is not satisfied for 
k = o(y/n), while the condition SSE n / fc > (log k) ^ 1 ' ■ logn • (f> is satisfied for k = O(logn). Sim- 
ilar to the low threshold rank graphs, [AGS13] may obtain a polynomial time algorithm only if 
SSE S (G) > poly(logn) • 0(G) for s = Q(n/logn). 

1.1 Our Results 

In this paper we design improved polynomial time approximation algorithms for uniform sparsest 
cut problem on graphs with a moderately growing spectrum, i.e., when the threshold rank can be 
as large as n°^ l >. We show that if there is a poly(log k) gap between A& and the optimum of ARV 
relaxation, then the solution of ARV relaxation can be rounded with an approximation factor of 
0(\/log k) in polynomial time. 



Theorem 1.1. There are universal constants ci,C3 and a randomized rounding algorithm such that 
for any graph G, if Xk > — ■ log k ■ sdp, then the Arora- Rao -Vazirani relaxation can be rounded 
with an approximation ratio 0(\/log k). Here, sdp is the optimum value of the ARV relaxation. 

For example, if k = n°^ l \ we get a o(\/\ogn) approximation for uniform sparsest cut. Conse- 
quently, to improve the 0(\/\ogn) approximation of [ARV09] it is sufficient to find an improved 
approximation algorithm for the high threshold rank graphs. 

We can also improve the ARV rounding if the graph is a small set expander, i.e. SSE n /fc(G) 3> 
0(G), and in fact a weaker condition, based on a parameter that we define next, suffices. The order 
k expansion of G is the smallest number (j)k(G) such that we can find k disjoint subsets of vertices 
each of expansion at most 0fc(G), 

MG) ■=.... min max .#(#). 

disjoint Di,...,AfcC\/ l<i<fc 

Note that for any graph G, (f>k(G) > SSE n / fe (G), and <f>k{G) > A^/2 but A& and SSE n / fc (G) are 
incomparable. Consequently, having a lower-bound on <fik(G) is weaker than having lower bounds 
on any of Afc or SSE n/ / fc (G) 2 . We show that if there is a O(\ogk^f\ogn) gap between 4>k{G), and 
the optimum of ARV relaxation, then the solution of ARV relaxation can be rounded with an 
approximation factor of 0{\J\ogk) in polynomial time. 

Theorem 1.2. There are universal constant 02,03 and a randomized rounding algorithm such that 
for any graph G, if 4>k{G) > — ■ log k ■ \/log n • log log n ■ sdp, then the Arora-Rao- Vazirani relaxation 



2 For example, if G is a complete graph with a separated vertex, then cj)k(G) — O(l) for any k > 2 whereas 
SSE 3 (G) = for any s > 1. 



can be rounded with an approximation ratio 0(\/\ogk). Here, sdp denotes the optimum value of 
ARV relaxation. 

Note that Theorem 1.2 is not necessarily stronger than Theorem 1.1 because the gap between 
4>k(G) and sdp is a function of n that can be significantly larger than k. 

In addition to the above results we can obtain a constant factor approximation algorithm if 
we are given a feasible solution of the fe-th round of the Sherali- Adams hierarchy starting at the 
Arora-Rao-Vazirani relaxation. 

Theorem 1.3. There are universal constants ci, C2,C4 and a randomized rounding algorithm such 
that for any graph G , if X k (G) > §Mog 2 fc-SA(2k) or if <f> k (G) > g • Vlog k log n ■ log log n ■ SA(2k) ; 
then the 2k-th round of the Sherali- Adams hierarchy of the Arora-Rao-Vazirani relaxation can be 
rounded with a constant factor approximation. Here, SA(2k) denotes the optimum value of the Ik 
rounds of Sherali-Adams hierarchy applied to ARV relaxation. 

The above result is weaker than results of Arora, Ge, Guruswami and Sinop. In [GS13], Gu- 
ruswami and Sinop achieve an arbitrarily good approximation, instead of a constant factor approx- 
imation, assuming that the ratio between A& and the optimum of the relaxation is a sufficiently 
large constant, while we need the ratio to be of the order of log k. In [AGS13], Arora, Ge and 
Sinop achieve an arbitrarily good approximation, instead of a constant factor approximation, as- 
suming that the ratio between SSE n /^(G) and the optimum of the relaxation is at least order of 
\/log fclogn, while we need it to be order of y^og k log n log log n. On the other hand, our results 
hold for the weaker Sherali-Adams relaxation, are proved via rather different techniques and, in 
the case of the second result, are based on an assumption on (pk(G) rather than SSE n / k (G). 

1.2 Structure Theorems and Improved Rounding 

A feasible solution of the ARV relaxation for a graph G = (V, E) is an assignment of a vector x„ 

to every vertex v £ V. The vectors are normalized so that Y^ u veV H Xu — x ^ll 2 = n2 an( ^ the cos ^ OI " 

the solution is 

sdp := } ||x n -x„|| 2 

2r ■ n *— ' 
(u,v)eE 

where r is the degree. (Furthermore, the distance function d x (u,t>) := ||x n — x„|| 2 satisfies the 
triangle inequality.) 

Our proofs follow from structure theorems on feasible solutions of ARV relaxation. We show 
that if Afc or (j)^ are sufficiently larger than sdp, then the SDP solution can be almost entirely 
covered by a small number (at most 2k) of balls of small radius. Then, we use this property to 
design improved rounding algorithms for graphs with moderately growing spectrum or small set 
expanders. 

Our structure theorems are in the spirit of on a recent work of Kwok et al. [KLL + 13]. They 
show that the nearly-linear time "sweep" algorithm based on the eigenvector of the second smallest 
eigenvalue of C has approximation ratio 0(fc/\/A&)- Their paper is based on a structure theorem 
showing that the eigenvector of the second eigenvalue of the Laplacian must be at distance at most 
0(y/\2/\k) from a vector whose entries contain only 2k distinct values. 

We show a similar structure theorem for the solutions of ARV relaxation. Our structure 
theorems show that there are universal constants c±,C2 such that for any < 8 < 1, if A& > 



c\ ■ log 2 k ■ sdp /8 or if 4>k{G) > c%- \/log n log k ■ log log n ■ sdp /S, then we can find 2k sets Xi , . . . , T^ 
of diameter (according to the distance d x (u, i>) := ||x u — x^H 2 ) at most 5 covering almost all vertices 
of G. 

If the solution of ARV relaxation has exactly 2k points, then we can give a O(^logk) approxi- 
mation using ARV rounding, or a constant factor approximation using 2/c-levels of Sherali- Adams 
hierarchy (2fc-levels of Sherali- Adams hierarchy enforces that on every 2k vertices we have an inte- 
gral cut). Our improved rounding algorithms provide a robust version of these facts. 

Our improved rounding scheme for ARV shows that if we can cover almost all of the vertices 
by 2k sets of diameter 5, then provided that 5 < c^/^/log k, where C3 is a universal constant, we 
can find in polynomial time a cut of expansion at most 0(\/logk ■ sdp). This proves Theorem 1.1 
and Theorem 1.2. 

If the solution {x v } v ^v as above is feasible for 2k rounds of Sherali- Adams applied to ARV, 
then, provided 5 < a, where C4 > is a universal constant, we can find in polynomial time a 
solution of cost at most O(sdp). This proves Theorem 1.3 

1.3 Techniques 

1.3.1 The Structure Theorems 

To prove our structure theorems, our general idea is to divide the ambient space of the vectors 
x v into regions of small diameters, and to look at the 2k regions with the most elements of the 
well-spread set guaranteed by the argument of Leighton and Rao. If such regions cover almost all 
the vertices in the well-spread set, then the covered vertices are themselves a well-spread set, and 
we are done. Otherwise, we find either k disjointly-supported functions /j : V — >■ R all of small 
Rayleigh quotient, leading to a contradiction to the assumption on A&, or k disjoint sets each of 
small expansion, leading to a contradiction to the assumption on </>&. 

To implement this idea in the case in which the assumption is on A^, we use, as in [LOT12], 
a "padded decomposition" of the ambient space of the vector. The problem with low-diameter 
padded decomposition, however, is that, if h is the dimension of the space, we will lose a factor of 
h 2 in the Rayleigh quotient (see Proposition 4.2). The ARV solution {x v } ve y that we start from 
may lie in a n-dimensional space, and so we would need an assumption of the form sdp > n 2 • A& 
in order to prove our result. To avoid this loss, we first map our solution {x v } v( zy to an 0(logA;)- 
dimensional space. The resulting solution {z„}„ g y may not satisfy the triangle inequality any 
more, but it has approximately the same cost as the solution {x v } vf zy, and for all but a o(l/A; 2 ) 
fraction of the pairs u, v the distances ||x u — x^H 2 and \\z u — z v \\ 2 are approximately the same. 
(But, we emphasize again, the distances \\z u — z v \\ 2 may not satisfy the triangle inequality.) Using 
a padded low-diameter decomposition, and the assumption A& > c\ ■ log k ■ sdp /5, we are then able 
to cover almost all of the vertices using 2k regions that have diameter at most 5/2 according to the 
distances \\z u — z v \\ 2 . We can then argue that we can also cover almost all the points using 2k sets 
whose diameter is at most 8 according to ||x u — x^lj 2 , or else we have Q,(n 2 /k 2 ) pairs (n, v) such 
that the dimension-reduction distorted their distance by a constant factor, which happens with low 
probability. 

To cover a well-spread set under the assumption on (p^ we have to overcome an additional 
problem: each edge will be cut with probability proportional to \\z u — z v \\ ~ ||x u — x„|| in the 
low-diameter decomposition, but it contributes only ||x u — Xj,|| 2 to the cost sdp of the solution 
{x v } v< zy, so that the fraction of cut edges (and consequently the assumption on 0^) would be 



proportional on -y/idp rather than sdp. We resolve this difficulty by first using a result of Arora, 
Lee and Naor [ALN08] to map the solution {x^} to vectors {y v } V £v such that for all pairs (u,v) 
we have ||x n — x„|| 2 < ||y« — y v \\ < 0(\/logn) • ||x u — x„|| 2 . Then we map to a solution {z v } v& y m a 
0(log/c)-dimensional space such that for all but o(n 2 /k 2 ) pairs u,v we have \\y u — Yv\\ < 2||z n — z v \\. 
Using this solution, we use the assumption 4>k > c 2~ \/l°g n log k ■ log log n ■ sdp jb to find 2k regions 
of diameter 5/2, according to the distances \\z u — z v \\ that cover almost all the vertices. Reasoning 
as before, this means that we also have 2k sets of diameter 5, according to the distances ||y M — y„|| 
that cover almost all the vertices. Since ||x u — x^ 1 1 2 < ||y M — y„ || for all pairs u, v G V, the diameter 
of the 2k sets is at most 5 according to the distances ||x u — x^H 2 . 

1.3.2 The Rounding Schemes 

An argument that goes back to Leighton and Rao [LR99] shows that either a simple rounding 
algorithm succeeds in finding a cut of expansion at most 4 sdp, or else there is a set A C V that is 
(4, 1/16)- well- spread. 
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Definition 1.4. Let {x v } v ^y ^ e an assignment of vectors to vertices such that Y2uv ll x « — x 
n 2 . We say that a set A C V is (a, /3) -well- spread if for all u,v G A, 

||x u X^ || v. ct, 
} y ||x„ -x^H 2 > (3-n 2 . 

u,v£A 

Let {x v } V £v be a feasible solution to the ARV relaxation, and let A be a (0(1), 0(l))-well- 
spread set of vertices. For a set U C V, and v £ V, let d^(v, U) := min ue ;y ||x u — x^H 2 . If we can 
find a set U C A such that J2 U v \dx.(u, U) — d^(v , U)\ > n 2 /5, then we can find a set of expansion 
at most 0(sdp /5). Arora, Rao and Vazirani [ARV09] show that such set can always be found, with 
5 = J)(l/Vlogn). 

Using a refinement of the result of [ARV09] proved by Arora, Lee and Naor [ALN08] , we can 
show that if the solution is such that the well-spread set {x v } v£ a occupies only k distinct points, 
then we can have 5 = Q(l/^logk), and that this remains true if {x^}^ e ^4 can be covered by k sets 
of diameter o(l/y/logk). 

Our other rounding scheme assumes that {x v } ve y is (part of) a solution that is feasible for 
2k rounds of Sherali- Adams applied to ARV. If a well-spread set A is covered by 2k low-diameter 
sets At, ... , A2k, let C = {a±, . . . , a2k} be a set of 2k vertices, with each aj G A,- L chosen arbitrarily 
from each region. The feasibility for 2k rounds of Sherali Adams implies that there is a probability 
distribution over metrics P(-, •) (described as part of the feasible solution) such that for every pair 
(n, v) we have ||x u — x^H 2 = K[D(u, v)], and we also have with probability 1 that ^ ul) I)(tt,D) = 
^2u v ll x " — x f IP = n2 an d that there is a partition Cx, Cr of C such that P(oj, aj) is zero if a^, aj 
are both in Cl or both in Cr, and it is the same positive value for all pairs in Cl x Cr. (That is, 
with probability 1, T>(-, •) is a cut metric on C.) Overall, sampling from this distribution, we find, 



with constant probability, a distance function T> such that 

Y^ T>(u,v) < 0(1)- J2 ll x «-^ll 2 
(u,v)eE (u,v)eE 

^2 V{u,v) > n(n 2 ) 

u,vGA 

^2v(u,C) < 0(5 -n) 

ueA 

which is enough, for small enough constant 6, to round the solution and find a cut of expansion at 
most O(sdp). 

2 Preliminaries 

2.1 SDP Relaxations 

The Arora-Rao-Vazirani relaxation is defined as follows; the variables x„, one for each vertex, are 
vectors. 



mm 

1r ■ n 

(u,v)eE 



1 V^ II _ m2 

• n *-^ 
(u,v)eE 

subject to 2, ll x « — x fll = n 

U,VdV 

||x u -x^l 2 < ||x M - x„,|| 2 + \\x. w - x„|| 2 \/u,v,w G V 

Next, we show that this SDP is indeed a relaxation for the uniforms sparsest cut problem. Let 
S C V with size s := \S\ be the optimum solution, i.e., < s < n/2 and (f>(S) = <fi(G). Then, for 
every v G V, we define 

_ if v G S, 



Xv = 1 V s ( n ~ s ) - (2) 

[0 otherwise. 

Since the above assignment defines a cut, it satisfies the triangle inequality. Furthermore, since 
s < n/2, 

2r ■ n *— ' 2r ■ n sin — s) 

(u,v)eE v ' 

Therefore, (1) is a relaxation of the uniform sparest cut problem. 

The following relaxation is a simplification of the fc-th round Sherali- Adams strengthening of 
ARV. There is a vector variable x„ for every vertex v G V, then there are also 2 • Qj) • Q) additional 
real variables d R,b (u,v), one for every two vertices u, v G V, every subset i? C V of cardinaility fc, 
and every map b : R — > {0, 1}, and finally there are 2 • P£) real variables p > . 



1_ V" ii 

r ■ n *— ' 



mm > ||x n -x 1 ,| |2 

2 r . n t — e 

(u,v)eE 
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(3) 



subject to y, ll x « ~~ x t>l 

||x„ — x^|| 2 < ||x u -x w || 2 + ||x w -x„|| 2 \/u,v,w G y 

d R ' 6 (u, u) < d R > b (u, w) + d fi ' V, v) V«, u, u; G 7 

||x u -x„|| 2 = ^ d R ' b (u,v) \fu,veWR 

b:R-+{0,l} 

J2 P R ' b = i vi? 

&:.R-»{0,1} 

p fi ' fe > Vi?, 6 

d fl ' ft (u, «) = Vu, tA/i?, 6. b(v) = b{u) 

The relaxation (3) can be interpreted in the following way. For every subset R C V of k vertices, 
we have a probability distribution p ' over the possible cuts b : R — > {0, 1} of R. For each such 
R and 6, we have a distance function d R ' 6 that satisfies the triangle inequality, and which has the 
following property. Fix any R, and consider the following distribution on metrics: pick a random 
6 : R — y {0, 1} with probability p R ' b , and define T>(u,v) := -^d R,b {u,v). Then each £>(■, •) in this 
sample space satisfies the triangle inequality and, for all pair of vertices u,v G V, 

E[V(u,v)} = ||x u -x,,|| 2 , (4) 

J^V(u,v) = n 2 . (5) 

u,v 

Furthermore, T>(-, •) defines a cut metric over R. 

Next, we show that (3) is a relaxation of the uniform sparsest cut problem. Let S be the 
optimum solution. Then, we assign the same value as in (2) to each vector x„. For each set R C V, 
let bn : R — > {0, 1} be the indicator of S in R, that is for any v G R, 

b R {v):=\ l ' liveS ' 
I otherwise. 

Then, for any b : R — > {0, 1}, we let p R,b := I [b = bn]. This defines a probability distribution for 
each set R. Furthermore, for any u,v G V let d R,b (u,v) := I [b = 6r] ||x u — x^|| 2 . It is easy to see 
this satisfies all of the constraints of (3). 



2.2 Notations 

Let {x v } v€ v £ R m be be a sequence of vectors assigned to the vertices of G. In this paper we will 
use two types of distance functions on these vectors. For any pair of vertices u, v € V, we use 

d^(u,v) : = llx^-x,,!! 2 
d x (u,v) := ||x u -x„||. 

Note that d x is just the Euclidean distance, and is always a metric, but d\ is not necessarily a 
metric. In fact, if d\ is a metric, we say vectors {x v } v ^v form a negative type metric. 
For a distance function d(., .), we define the energy of the graph as follows: 

£(d) ■= ^ V d(u,v), 

2r ■ n t—' 

(u,v)eE 

For a distance function d(., .), and a set SCK, the diameter of S is the maximum distance between 
the vertices of S, 



For every u € V, let 



diam(5, d) := max d(u,v). 

u,vES 



d(u,S) := min d(v , u) . 



For any u £ V and r > 0, the ball of radius r about u is the set of vertices at distance at most r 
from u, 

Bd{u,r) := {v : d(u,v) < r} 

2.3 Spectral Graph Theory 

For a r-regular graph G = (V,E), the Rayleigh quotient of a function / : V — >■ R, is defined as 
follows, 

_ m _ Ek^(/(^)-/(^)) 2 

A proof of the following statement can be found in [KLL + 13, Lem 2.3]. 

Fact 2.1. For any graph G, and any k disjointly supported functions /i, . . . , /& : V — > R, 

A fc < 2 max 7^(/j). 
i<«<fc 

2.4 Random partitions of metric spaces 

We now discuss some of the theory of random partitions of metric spaces. Let d(., .) be a metric 
defined on the vertices of G. We will write a partition P of V as a function P : V — > 2 mapping 
a vertex v G V to the unique set in P that contains v. 

For 5 > 0, we say that P is 5-bounded if diam(S', d) < 5 for every S £ P. We will also consider 
distributions over random partitions. If V is a random partition of V, we say that V is 5-bounded 
if this property holds with probability one. 



A random partition V is (<5, a, e)-padded if V is (5-bounded, and for every v £ V, we have 

F[B d (y,6/a) CP(o)] > e. 
A random partition is (5,L)-Lipschitz if "P is 5-bounded, and, for every pair ii^eF, we have 

d(u, v) 



\V{u) / P(v)] < L 



5 



Here are some results that we will need. The first theorem is known, more generally, for doubling 
spaces [GKL03], but here we only need its application to M fc . See also [LN05, Lem 3.11]. 

Theorem 2.2. If vertices are mapped to M, k , and d(., .) is the Euclidean distance between the 
vertices, then for every 5 > and e > 0, (V, d) admits a (<5, O(kfe), 1 — e) -padded random partition. 

The next result is proved in [CCG + 98]. See also [LN05, Lem 3.16]. 

Theorem 2.3. // vertices are mapped to M. k , and d(., .) is the Euclidean distance between the 
vertices, then for every 5 > 0, (V, d) admits a {5,0(\fk))-Lipschitz random partition. 

3 Statement of Our Results 

Theorem 3.1 (Afc Structure Theorem). There is a universal constant c\ such that for any sequence 
of vectors {x. v } v€ y , and e, 5 > 0, if 

X k >c 1 .\ogH.£(dl). l -^l^, 

then there are 2k sets T\, . . . , T^ with diameter diam(Tj, d^) < 5 covering 1 — e fraction of vertices. 
Furthermore, we can find these sets by a randomized polynomial time algorithm. 

Theorem 3.2 (</>£. Structure Theorem). There is a universal constant ci such that for any sequence 
of vectors {x v } ve y that form a metric of negative type, and e, 5 > 0, if 



4> k > c 2 ■ Vlog k ■ log n ■ log log n ■ £((%.) ■ ^- , 

e • o 

then, there are 2k sets T\, . . . ,T2k with diameter diam(Tj,c^) < 5 covering 1 — e fraction of vertices. 
Furthermore, we can find these sets by a randomized polynomial time algorithm. 

Theorem 3.3 (Rounding ARV). There are universal constants c^,e\ > and a randomized poly- 
nomial time rounding algorithm such that the following holds. 

For any feasible solution to (1), {x v } v ^v of cost sdp, if there are 2k sets T\, . . . , T<ik of diameter 

< c^/^Jlogk covering (1 — e\)n vertices, then the rounding algorithm finds a cut of expansion at 
most sdp • 0(\/log k) with high probability. 

Theorem 3.4 (Rounding Sherali-Adams Relaxations). There are universal constants c\,e\ > 
and a randomized polynomial time rounding algorithm such that the following holds. 

For any feasible solution to (3), {x v } v ^v of cost sdp, if there are 2k sets T±, . . . , T<ik of diameter 

< C4 covering (1 — e\)n vertices, then the rounding algorithm finds a cut of expansion at most 
O(sdp) with high probability. 

10 



4 The Structure Theorems 

4.1 Afc structure theorem 

In this section we prove Theorem 3.1. Using the following lemma we project the solution of SDP 
to a 0(log k) dimension space. This helps us to decrease the loss in partitioning to a function that 
is only depend (poly-logarithmically) on k. We show that the solution of SDP only suffer by a 
constant factor and most of the pair of vertices will not get distorted. 

Lemma 4.1. For any sequence of vectors {x v } v£ y C R m , and e > 0, there exists vectors {z v } v€ y G 
M. h where h = ©(log 1/e) such that 

£(d z ) < 4£(d x ), 
£(d 2 z ) < 4f(4), 
\{(u,v) : [|xu -x„|| 2 > 2\\z u -z v \\ 2 }\ <en 2 . 

Proof. Let gi,g2, ■■■ -,9k be i.i.d. m-dimensional Gaussians, and consider the random mapping 

r m ^ : R fc — > W h defined by T m ^{x) = h~ l ' 2 ({gi, x), (§2, x), . . . , (gh, x)). Then we have the following 
basic estimates (see, e.g. [Mat02, Ch. 15] or [LT11, Ch. 1]). For every x G R m , 

E[||r m ,,(x)|| 2 ] = ||*|| 2 , (6) 

E[\\T m:h (x)\\} < \\x\\. (7) 

Equation (7) follows from (6) and the Jensen's inequality. Also for every a E (0, k], 



(1 - a)\\x\\ z < \\T m , h (x)r < (1 + a)\\x\\ z \ > 1 - 2e~ a ^ , (8) 

Choose h = 0(logl/e) such that 2e~ ' 48 < e/4. Then, by (8) for each u,v G V, we get 

P [||r m , h (x« - x u )|| 2 < ||x„ - x u || 2 /2] < e/4. 
By linearity of the r m ,/i operator, 

J- m,h(P^v x uJ = ^ m,hv x u) J- m,/iv x uj- 

Therefore, by (6) and (7) and Markov's inequality with probability 1/4 we get vectors {r mi /j(x t ,)}„ e y C 
R that satisfies all inequalities in lemma's statement. □ 

The following proposition is the main part of the proof. Here we show that if we can not find 
2k regions of small diameter covering almost all of the vectors {z v } v£ y then A& must be very large. 

Proposition 4.2. There is a universal constant d x such that for any sequence of vectors {z v } ve y G 
R h , andO < e,5 < 1/2 if 

h 2 

then there are 2k sets Si, . . . , S^fc such that diam(Si,d z ) < 5 for 1 < i < 2k, and 

2k 

||JSi|>(l-e)n. 



i=l 
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Proof. We show that if we can not find 2k sets of small diameter covering almost all of the vertices, 
then we can construct k disjointly supported function of Rayleigh quotient 0(h 2 £(d 2 )/e 3 5 2 ). Then 
by Fact 2.1 we get a contradiction to the assumption that A& is large. The proof has three main 
steps: random partitioning, merging into dense regions, and construction of disjointly supported 
functions. 

i) Random partitioning. By Theorem 2.2 there exists a (<5, a, 1 — e/4)-padded random par- 
titioning for the metric space (V, d z ), for a < h/e. For S C V, the interior of S is the set 

S:={v:B dz (v,5/a)OS}. 

By linearity of expectations and Markov inequality, with probability at least 1/2 we sample a 
partition P such that, for each S £ P, diam(5, d z ) < 8, and 

£ \S\ > (1 - e/2)n. (9) 

SeP 

Choose a labeling S± , S2 , ■ ■ ■ of the sets of P such that 

|£i|>|£ 2 |>...>|5| P ||. (10) 

If Yli=i Pi I — (1 — e ) n > then S±,..., S^ form 2k sets of diameter 5 covering (1 — e)n vertices, 
and we are done. Otherwise, we get a contradiction to the assumption that Afe is large. Using 
equation (9), for the rest of the proof we assume 

Y. 1^1 ^ €n i 2 - ( n ) 

i>2fc 

Next, we merge the sets in P greedily into 2k disjoint sets T\, . . . , T 2 fc such that mini<j<2fc |Tj| > 
||. Then, we construct 2k disjointly supported functions /1, . . . , f2k where supp(/j) C Tj. Finally, 
We show that the best k of these 2k functions have Rayleigh quotient 0(a 2 £(d 2 .)/e5 2 ). 

ii) Merging. We start by constructing T\, . . . ,T2k- First let Tj = Si, for 1 < i < 2k. Then, 
iteratively, for each i > 2k we merge Si with the set with smallest interior, argmin r . \Tj\. In the 
next claim we show that, by the end of the algorithm, all T- h have at least en/8k vertices in their 
interior. 



Claim 4.3. 



min \TA > en/8k. 

Ki<2k 



Proof. If |i?2fc| > en/8k, then we are done, since, by equation (10), for all 1 < i < 2k, 

\fi\ > \Si\ > \s 2k \. 

Otherwise, we have S2k+i < en/8. By equation (11), 



2 k 



J2\Ti\Si\>en/2. 



i=X 
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Therefore, for some 1 < j < 2k, \Tj \ Sj\ > en/4. Let Si be the last set that is merged with Tj. 
Since Si is merged with Tj , Tj is the set with the smallest interior at the time of merging with Si . 

Therefore, 

. - - e . - g =. en en en 

D 

iii) Construction of disjointly supported functions. It remains to construct ft disjointly 
supported functions of small Rayleigh quotient. For each 1 < % < 2k, let /j : V — >■ M be defined as 

/i(u) := max(0, 1 - a • d z (v,Ti)/6). 

Note that for v £ T, fi(v) = 1, and the value of /«(f) decreases smoothly with the distance of v 
from Ti, in particular, /j(u) = 0, for v ^ Tj. It follows from the above definition that each fi is 
a/<5-lipschitz w.r.t. d z (., .), that is, for all u,v EV and 1 < i < 2k, 

OL 

\fi(u) - fi(v)\ < -d z {u,v). 
Since Ti, . . . , T 2 k are disjoint, f\, . . . , f 2 k are disjointly supported. Therefore, 

2k 9 2 

E E (/«(«)- /i(«)) a = ^ E <£(«.«)• 

*=l (u,v)eE (u,v)eE 

By Markov inequality, after relabeling, we get fe disjointly supported functions fi,- ■ ■ , fk such that 
for all 1 < i < k, 

E (/<(u) - /i(t;)) a < ^ E <£(«.*)■ 

(«,«)eB (u,v)eE 

By an application of Fact 2.1 we get 

x „ 9 _,,, 9 E Me E(/iM-/i(«)) 2 . JEmAj 

A fc <2max^(/ 4 ) = 2max ^JT^ ^ ^^ 

64a 2 £ (eg) /i 2 £(cg) 

e-5 2 ~ e 3 -5 2 

Therefore, A& < c'{h 2 £(d 2 .)/e' i 5 2 for a universal constant c'/. Letting c^ > c^' we get a contradiction. 

□ 

The following lemma is the last step of the proof. We show that sets of small diameter in 
0(logA;) dimension space corresponds to sets of small diameter in the original space of the ARV 
solution. 

Lemma 4.4. Let d{., .),d'(., .) be two distance functions on V. Let S\,. . . ,S 2 k Q V be such that 
diam(Sj, d) < 5 for all 1 < i < 2k, and let S := U^S*. If, for e > 0, 



\{(u,v) : d'(u,v) > 2d(u,v)}\ < 



e 3 n 2 



64k 2 ' 

then, there are 2k sets Ti, . . . , T 2 k of diameter 4<5 w.r.t. to d' that cover all but en of the vertices of 
S. 
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Proof. Wlog we assume that Si,... , S2k are disjoint (we can simply delete multiple copies of a 
single vertex) . We say a set Si is good, if there exists u £ Si such that 

\B dl (u,2S)nSi\>(l-e/2)\Si\, 

and it is bad otherwise. We use A to denote the indices of good sets. By the above definition, for 
each good set Si, there is a ball B#(u, 25) such that \Bd'(u, 25) n Si\ > (1 — e/2)\Si\. We define Tj 
to be such ball if i £ A and Tj = otherwise. Then, 

2fc 2fc 

| U Ti\ > J2 \Ti n Si\ > £ \Si\{l - e/2). 

i=l i=l i€A 

Since |T| < n, to prove the lemma it is sufficient to show that 

^2\S l \>\T\-en/2. (12) 

ieA 

Equivalently, we can show X^aI^I — en /2- We say that a pair of vertices u,v is distorted, if 
d'(u,v) > 2d(u,v). Observe that each bad set Si contains e|S'j| 2 /4 distorted pairs. This is because 
for each vertex u £ Si we have Si C Bj,(u,5), but at least e|5j|/2 of the vertices of Si are not 
contained in the ball B#{u, 25). 

Since Si,..., S2k are disjoint, the total number of distorted pairs is at least, 

v 4§! > e( Ei iA \Si\ y _^_ (VrnV 

*-o 4 ~ ±\2k-\A\ ) ~ 16/fc 2 I ^' 'J ' 

i^A ' ' \igA / 

On the other hand, by the lemma's assumption, the number of distorted pairs is at most e 3 n 2 /64/c 2 . 
This implies that Yli&A \&i\ — en /2 which proves (12) and 

2k 

Y,\Ti\>\S\-en. 

i=i 

D 

Proof of Theorem 3.1. First, by Lemma 4.1, there are vectors {z v } v ^y ^ K > where /i = 
6(logA;logl/e), such that £(ti 2 ) < 4£(ci 2 ), and 

3 2 

\{(u,v) : <£(«,«) > 2(i 2 (^)}| < -^. (13) 



512fc 2 ' 

„ ^TV,™ o^^ Q P(j2\ <- A?(&\ +V, Q ooc 

(logfclogl/e)* 



Let ci := -jq — , j 1 i / ^ • Then, since £(d^) < 4£(ci 2 ), the assumption of the theorem implies 



X k > c[ - £{dl). 

(e/2)3( v / V4) 2 

By Proposition 4.2, there are 2k sets Si, . . . ,S2k such that diam(Sj,<i 2 ) = (diam(S'j,d z )) < 5/A 
for 1 < i < 2k, and they cover (1 — e/2)n vertices of G. Finally, by (13), we can apply Lemma 4.4 
with d = d\, d! = d 2 , we get 2k sets Ti, . . . , T 2 k with diameter diam(Tj, d 2 ) < 5 that cover 

2k 

(1 - e/2)| (J Si I > (1 - e/2)(l - e/2)n > (1 - e)n 



i=l 



14 



vertices of G. Observe that all steps of the analysis are constructive, so we have a polynomial time 
randomized algorithm for finding T\ , . . . , T 2 k ■ D 

4.2 <pk structure theorem 

The proof of the next proposition is very similar to the proof of Proposition 4.2. The main difference 
is that we use Lipschitz partitioning instead of padded partitioning. This is because we just want 
to find k disjoint non-expanding sets, so we just need that very few edges are cut by the random 
partitioning. 

Proposition 4.5. There is a universal constant c' 2 such that for any sequence of vectors {z v } ve y € 
R h , andO < e,8 < 1/2, if 

4>k>c' 2 r£(<2z), 

e • o 
then there are 2k sets T\, . . . , T 2 k such that diam(Tj, d z ) < 5 for 1 < i < Ik, and 

2k 

\\j T i\ >(l~eK 

Proof. We show that if we cannot find 2k regions of small diameter covering almost all vertices 
then we can construct k disjoints sets of small expansion. 

i) Random partitioning. By Theorem 2.3 (V,d z ) admits a (6, a)-Lipschitz random partition 
V for a = 0(y/h) such that for all pair of vertices u, v 

P[V(u)^V(v)]<a dz{u,v) 



5 

By linearity of expectation and Markov inequality, with probability at least 1/2 we sample a 
partition P such that for all S G P, diam(5', d z ) < 5, and 

\{(u,v)eE:P(u)^P(v)}\<^ Y, d ^ v )- ( 14 ) 

(u,v)£E 

Choose a labeling Si,Sz, . . . of the sets in P such that 

\Si\>\S 2 \>...>\S\ P \\. (15) 

If Yli=i \&i\ — (1 — e ) n > then S\, . . . , S 2 k are 2k sets of diameter 5 covering (1 — e)n vertices and we 
are done. Otherwise, we get a contradiction to the assumption that <pk is large. For the rest of the 
proof we assume 

J^ \Si\ > en. (16) 

i>2k 

Next, we merge the sets into 2k disjoint sets 2\, . . . ,T 2 k such that each Tj covers at least en/ 4k 
vertices of G. Then, we show that at least k of these sets have expansion 0{o£(d z )/eS). 

ii) Merging. This part is very similar to the Merging part of Proposition 4.2. We start by 
constructing T\, . . . ,T 2 k- First let Tj = Si for 1 < % < 2k. Then, iteratively, for each i > 2k merge 
Si with argminp. |Tj|. We show that by the end of this procedure 

en 
mm ITA > —. 
l<i<2k 4fc 
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In particular, if |<S2jfc+i| > en/Ak, then by (15) we get the above equation. Otherwise, by equation 
(16), there is a set Tj such that \Tj \Sj\ > en /2k. If Si is the last set merged with Tj, by the time 
Si was merging with Tj, Tj has the least number of vertices. Therefore, 

min \Ti\ < IT,- \ SA > \Tj \ Sj \ Si\ > en/\k. 

l<i<2k i J \ i ijnjni 

iii) Construction of non-expanding sets. By equation (14) 

2fc 

Y / \E(T i ,T\)\<\{(u,v)€E:P(u)^P(v)}\<^ £ £(«,«). 

«=1 (u,v)EE 

Therefore, by Markov inequality, after relabeling, we get k disjoint sets T±,. . . , T^ such that for all 

a • |ij| a • en/4fc e • o e • o 

Therefore, <^ < c^' • Vh£(d z )/e5 where c 2 ' is a universal constant. Letting c' 2 > d^ we get a 
contradiction. D 

Proof of Theorem 3.2. Since {x v } v ^y form a negative type metric, by [ALN08], there are vectors 
{y v }v£V such that for all u,v £ V, 

\\*v - x u || 2 < ||y„ - y u || < a||x„ - xj 2 . (17) 

where a < \/Iog n • log log n. By above equation £{d y ) < a£(d 2 ). By Lemma 4.1 we can reduce 
the dimension to 0(logA;). There are vectors {z v } v& y C R ft for h = ©(log klog 1/e) such that 
£(d z ) < 4£ (dy), and 

e 3 n 2 



|{(u,v) : d y (u,v) > 2d z (u,v)}\ < ^^ . (18) 



Let C2 := 32c' 2 • a ■ \/ i fci i/ e - Then, since £{d z ) < Aa£{d^.) by theorem's assumption we get 

Xk ~ C ' 2 • ( e /2)(W (4) ' 

By Proposition 4.5, there are 2k sets Si, . . . , S2k such that diam(Sj, d z ) < 5/ A for 1 < i < 2A:, and 
they cover (1 — e/2)n vertices of G. By (18), we can apply Lemma 4.4 with d = d z ,d' = d y , we get 
2k sets Ti, . . . , T2& with diameter diam(Tj, dy) < <5 that cover 

2fc 

(1 - e/2)| (J Si I > (1 - e/2)(l - e/2)n > (1 - e)n 

vertices of G. Finally using (17) we get that the dimeter of each set Tj is at most 5 w.r.t. d 2 , 

diam(Tj,d 2 ) < diam(Tj,d y ) < 5. 

Observe that all steps of the analysis are constructive, so we have a polynomial time randomized 
algorithm for finding T\ , . . . , T2k ■ D 
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5 The Rounding Algorithms 

Linial, London, Robinovich [LLR95] observed that by defining a Frechet embedding we can round 
any metric d(., .) into a set of small expansion. 

Fact 5.1. For any metric d(., .) that satisfies the triangle inequality, and any set U C V , there is 
a polynomial time algorithm that finds a set of expansion 

E {u , v )eE\d(u,U)-d(v,U)\ 
^■Eu,Jd(u,U)-d(v,U)\ 

Leighton, Rao [LR99] and Arora, Rao and Vazirani [ARV09] used the above simple fact to show 
the following preprocessing step. 

Fact 5.2 ([ARV09]). There is a polynomial time algorithm that given a feasible solution {x v } v£ y 
to the ARV relaxation (I) of cost sdp either finds a cut of expansion O(sdp), or finds a set W that 
is (4, l/16)-well spread. 

Observe that if W is a (4, l/16)-well spread set as in the conclusion of Fact 5.2, and A is obtained 
from W by removing n/128 vertices from W, then A is still (4, l/32)-well spread. Let e± := 1/128. 
Therefore, given 2k disjoint sets T±, . . . ,T^ covering (1 — e\)n vertices of G, and a (4, l/16)-well 
spread set W as in the conclusion of Fact 5.2, the set A := W n (U^jTj) is (4, l/32)-well spread. 
Furthermore, we can write A as A = A\ U A<i U . . . U A^h where Ai := W U Tj. 

So, in the rest of this section we assume A is (4, 1/32) well spread and all of its vertices are 
covered by 2k sets A\ , . . . , Aik of diameter 8 according to <i x . 

5.1 Rounding ARV 

Given a feasible solution to (1) that has a well-spread subset of vertices, by Fact 5.1 finding a good 
rounding reduces to finding a set U C A that is well-separated from the rest of the vertices of A. 

Fact 5.3. Let {x v } v& v be a feasible solution to (1) of cost sdp, and suppose that A C V is a 
(0(1), 0,(1)) -well spread set. Then there is a polynomial time algorithm that given {x v } ve y and a 
subset U C A, finds a cut of expansion at most 

o(i) ■ n2 • sdp 



Z u ,vzAK(v,u)-dUu,u)\- 

Arora et al. [ARV09] prove that a set U such that Y. u ,v&A l d xK u ) ~ ^xK u )\ > fi(n 2 /\/logn) 
always exists and can be found in polynomial time. This is best possible in general. 

Suppose, however, that {x v } vG a concentrate in just k distinct points. Then let C C A be a set 
of "representatives," that is \C\ = k and we have that for every v £ A there is a unique u £ C such 
that x„ = x u . Let also w(u) be the number of vertices v £ A such that x„ = x u . The condition 
that A is well-spread can be written as 

Y^ w(u) ■ w(v) • I [dl(u,v) > Q(l)] > Q(n 2 ) 

u,v€C 

and we are looking for a subset t/CC such that ^ u veC w(u)w(v)\d^(v, U)—d^(u, U)\ > fl(n 2 / \/log k). 
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Lemma 5.4 (ARV with weights). Let {xu}u€C be a sequence of vectors assigned to vertices of C 
that form a metric of negative type. For any set of weights w : C — > R+, if 

y w(u)w(v)I [d x (u, v) > A] = a ■ n , 

u.v 

then there is a set U C C such that 

Y, w(u)w(v)\d 2 x (u, U) - di(v, U)\ > a ■ A • n 2 /y^g~\C\. 

Proof. We use the following lemma from Arora, Lee, Naor 

Lemma 5.5 (Arora et al. [ALN08]). There exist constants C5 > 1 and < p\ < 1/2 such that for 
sequence of vectors {x v } v eC that form a negative type metric, and every A > ; the following holds. 
There exists a distribution \x over subsets U C C such that for every u,v E C with d^(u, v) > A, 



u&U and d\{v,U) > 



A 



>Pi- 



C 5 y/l0g\X\ 

We show that with a constant probability a random set U satisfies the lemma. Define 

A 



X uv :— I 



dx(u, v) > A and u eU and d£(v, U) > 



c 5y /log\X\ 



Then, by above lemma, 



E 



y~]w(u)w(v)X UtV 



pi Y, w(u)w(v)I [d(u, v) > A] = pi ■ a ■ 



n 2 . 



Since ^2 U v w{u)w{v)X U)V < a ■ n 2 with probability 1, we get 

Y w(u)w(v)X u>v >pi-a-n 2 >p 1 . 

Therefore, with probability p\ we get a set U such that 

Pl ■ a ■ n 2 < Y,™(u) W (v)X u , v < j;w(u)w(v)^ U) ~^ V ' U)l 

D 

Proof of Theorem 3.3. Wlog we assume that A is a (4, 1/32) well spread and all of its vertices 
are covered by 2k disjoint sets A\, . . . ,^2fc of diameter 5 < c^/\/logk according to d\. For a = 
1/256, A = 1/64, we have 

J2 l[dl(u,v)>A]>a-n 2 . (19) 

u,vdA 
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Choose a vertex aj as a center from each set Ai, and let C := {a\, . . . , 02fc}- Also, let w(a>i) = 
\Ai\. Let c : A — >■ C, where for each vertex v £ A, if v £ Ai, then c(a) = a%. Then, for 5 < A/4, 

Y I [d 2 x (u,v) > A] < £ I [4(c(«),c(t;)) > A/2] = ^ ti^M*)! [<£(«, i,) > A/2] . (20) 

where in the first inequality we used the fact that d 2 , is a metric. Putting (19) and (20) together 
we can apply Lemma 5.4 and we get a set U C A such that 

/ 2 

Cn ■ 7T 



Y w(u)w(v)\dl(u, U) - dl(v, U)\ > -j==, (21) 



ti.uec 



where c 3 > is a universal constant. Now choose C3 = min^^ydog k, A/4). We get, 

Y \dl(u,U)-dl(v,U)\ > Y, \dl(c(u),U)-dl(c(v),U)\-\A\J24(v,c(v)) 

u,v£A u,vdA vdA 

where the last inequality follows by (21). The theorem follows by an application of Fact 5.3. □ 

5.2 Rounding Sherali-Adams Relaxations 

Proof of Theorem 3.4- Wlog we assume that A is a (4, 1/32) well spread and all of its vertices are 
covered by 2k disjoint sets A\, . . . , A2k of diameter 5 < C4 according to d 2 ,. We choose a vertex a, 
arbitrarily as the center of Ai, and define C := {ai, . . . ,02k} Also, let c : A — >• C, where for each 
v £ A, if v E A4, then c(a) = ai. 

Let R = C, and pick a random b : R — > {0,1} with probability p ' , and define T>{u,v) := 
-j^d R ' b (u,v). Then, we get (4) and (5), i.e., ^2 U v T>(u, v) = n 2 , and for all pairs u, v G V, 
K[D(u, v)] = d\{u,v). By linearity of expectation and Markov inequality, with probability 1/65 we 
get, 

Y V{u,v) < (l-l/64)n 2 . 

u,v£V:u<£A or vfA 

Thus, Y2uveA ~D(u,v) > n 2 /64. By (4), union bound and Markov inequality we get the following: 
there is a universal constants C4 := 100 • C4, such that with a constant probability we get a metric 
d(-, •) satisfying all the following conditions: 

Y d ( u ' v> > ~ Y^ d l( u > v ) 

(u,v)eE (u,v)<=E 

Y d (u,v) > n 2 /U. (22) 

u,v£A 

Y d ( u , c ( u )) < c' 4 -\A\ (23) 

uGA 
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Furthermore, d(., .) is an integral cut on C, i.e., for all three vertices u,v,w £ C there are two 
whose distance is zero w.r.t. d(., .). Define the Frechet embedding 

f(v) :=d(ai,v) 

Then since d(., .) is a metric, we have 

£ |/(u)-/(«)|< Yl <^)£ £ d xK^) 
(u,v)eE (u,v)eE (u,v)eE 

We show that XmueA \f( u ) ~ f( v )\ ^ c 4 n2 f° r a sufficiently small C4 > 0, and this completes the 
proof using Fact 5.1. 

Claim 5.6. For every pair of vertices u,v £ A, 

\d(a\,u) — d(a\, v)\ > d(u, v) — 2d(u, c(u)) — 2(y, c(v)). 

Proof. We consider two cases, i) d(c(u), c(v)) = ii) d(c(u),a\) = 0. Since d(., .) defines an integral 
cut on C one of the two cases would happen without loss of generality. 

case i) This case follows from the fact that the LHS is always non-negative but the RHS 
is less than or equal to zero. This is because \d(ai,u) — d(ai,v)\ > but by triangle inequality 
d(u, v) < d(u, c(u)) + d(v, c(v)). 

case ii) This case follows by the assumption that d(., .) satisfies triangle inequality. 

\d(ai,u) — d(ai,v)\ = \d(c(u),u) — d(c(u),v)\ > d(v,c(u)) — d(u,c(u)) > d(u,v) — 2d(u,c(u)). 

D 
Therefore, 

Y, \f( u ) - f( v )\ ^ Yl d ( u > w ) ~ 2 ' yl i Yl d ( u > c ( u ^ - n2 / 64 ~ 2c 4 ■ n2 - c 4 n2 - 

u,v£A u,vdA udA 

where the second inequality follows by (22) and (23), and the last inequality follows by letting 
c' 4 < 1/256, and c 4 < 1/25600. □ 
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